CDS

Accession Number TCMCG004C03735
gbkey CDS
Protein Id XP_025611787.1
Location complement(join(102601282..102601366,102601500..102601642,102601728..102601811,102602232..102602291,102602377..102602457,102602627..102602740,102603020..102603092,102603229..102603392,102603567..102603760,102603867..102603942,102604021..102604358,102604808..102604895,102605037..102605117,102605709..102605849,102606019..102606075,102606185..102606407,102606538..102606697,102606807..102606876,102606967..102607013,102607110..102607128))
Gene LOC112705145
GeneID 112705145
Organism Arachis hypogaea

Protein

Length 765aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA476953
db_source XM_025756002.2
Definition general transcription and DNA repair factor IIH helicase subunit XPB1 [Arachis hypogaea]

EGGNOG-MAPPER Annotation

COG_category KL
Description DNA repair helicase
KEGG_TC -
KEGG_Module M00290        [VIEW IN KEGG]
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko03021        [VIEW IN KEGG]
ko03400        [VIEW IN KEGG]
KEGG_ko ko:K10843        [VIEW IN KEGG]
EC 3.6.4.12        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko03022        [VIEW IN KEGG]
ko03420        [VIEW IN KEGG]
map03022        [VIEW IN KEGG]
map03420        [VIEW IN KEGG]
GOs GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005634        [VIEW IN EMBL-EBI]
GO:0005737        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0009314        [VIEW IN EMBL-EBI]
GO:0009411        [VIEW IN EMBL-EBI]
GO:0009416        [VIEW IN EMBL-EBI]
GO:0009628        [VIEW IN EMBL-EBI]
GO:0043226        [VIEW IN EMBL-EBI]
GO:0043227        [VIEW IN EMBL-EBI]
GO:0043229        [VIEW IN EMBL-EBI]
GO:0043231        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0050896        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGGACAACACGGACACGGTGATAAGGGCCGACCCTTCAAAAAGTTCAAGCCCTCTAACAAATTCGAAGATAGCAGCAAAAGAGGATTCGATGACGATGACGTTTATGGCGGCGACGATGCACACGACGAGGATGATGGCAAAGTCAAAGATTTTAGCAAATTAGAGCTGAAACCGGATCACCTCAATCGTCCTCTCTGGGCCTGTGGCAATGGCCGCATATTCCTTGAGACCTTCTCTCCTTTGTACAAGCAAGCTTATGATTTTCTTATTGCCATTGCCGAACCCGTTTGCAGGCCGGAGTCTATGCATGAATACAACCTTACACCACACTCGCTGTATGCTGCTGTTTCTGTTGGTCTGGAAACAGAAACTATCATATCCGTTTTGAACAAGTTATCAAAGACCAAGCTTCCCAAAGAGATGATTAGTTTCATACATGATTCCACTGCTAATTATGGTAAAGTGAAGCTGGTGCTCAAGAAGAATCGCTACTTTATTGAATCTCCATTTCCTGAGGTATTGAAGACATTGCTTAAAGATGAAGTCATATCTCGAGCAAGAATTACTTCTGAGGGTACAAATGGGGATGGATTTACAATTAGCAAAGCAGCAGGTGAAATTGAAGGCAGACATGACGAGTTGCTAAATGAAGCCGAGGTGGCAGCAGCTGCTGAAGAGAAAGAAACTCATGCTTTTGAAATTAATCCTTCTCAGGTTGAAAATGTAAAGCAACGGTGCTTGCCAAATGCATTAAATTATCCCATGTTGGAGGAGTATGATTTCAGAAATGATACAGTGAACCCTGACCTTGACATGGAACTAAAGCCTCAAGCACAACCACGACCTTATCAAGAGAAGAGCCTTAGCAAAATGTTTGGAAATGGTAGAGCAAGATCTGGTATAATAGTCCTGCCTTGTGGTGCTGGAAAGTCCCTGGTTGGTGTATCTGCAGCTAGCCGGATCAAGAAGAGTTGCCTTTGTTTGGCAACAAATGCTGTCTCTGTAGATCAGTGGGCTTTTCAGTTTAAACTATGGTCAACTATCCGAGAGGAAAATATTTGCCGTTTTACATCTGATAGCAAAGAGAGATTCCGTGGTAATGCTGGAGTTGTTGTGACAACATATAATATGGTTGCTTTTGGTGGTAAACGGTCTGAAGAATCTGAAAAGATCATTGAAGAAATAAGAAACAGAGAATGGGGATTACTCCTTATGGATGAGGTGCATGTGGTTCCAGCCCATATGTTTCGAAAAGTCATTAGTATCACTAAATCTCACTGCAAACTTGGGCTAACAGCTACACTTGTGAGAGAGGATGAAAGGATTACAGATCTCAACTTCCTAATTGGTCCCAAGCTGTATGAGGCAAATTGGTTAGACTTAGTAAAAGGTGGATTTATTGCAAATGTTCAGTGTGCTGAAGTATGGTGTCCAATGACAAGAGAGTTTTTTGCTGAGTATCTGAAGAAAGAGAATTCCAAGAAGAGGCAGGCACTTTATGTGATGAATCCAAATAAGTTCAGAGCTTGTGAATTTCTTATAAATTACCATGAAAGGGCACGTGGCGATAAAATTATTGTCTTTGCTGATAATCTTTTCGCTCTCACTGAGTATGCCATGAAACTCCGCAAACCAATGATATATGGTGCTACAAGCCATGTTGAGAGGACGAAAATTCTACAAGCATTCAAAACTAGCAAAGACGTCAACACTGTTTTTCTTTCGAAGGTGGGTGACAACTCGATTGATATTCCCGAAGCAAATGTGATAATTCAAATTTCTTCGCATGCTGGTTCTAGGCGTCAAGAAGCCCAGCGTCTAGGTCGTATTCTTAGGGCCAAGGGAAAGCTTCAGGATAGGATGGCAGGCGGTAAAGAAGAGTATAATGCATTTTTTTATTCCCTTGTCTCAACTGATACCCAGGAGATGTATTACTCGACTAAAAGGCAACAATTTTTAATTGATCAGGGTTATAGCTTTAAGGTAATTACAAGCTTGCCTCCACCTGATGAAGGGCCGCGATTGAGCTATCATCATCTTGATGATCAACTTGCACTTCTTTCAAAGGTATTGAGTGCTGGTGATGATGCAGTGGGGCTAGAACAGCTAGAAGAAGATACAGATGAAATAGCTCTCCGCCATGCCCGTCGTTCTCAAGGATCAATGAGTGCAATGTCAGGTGCAAAGGGGATGGTTTACATGGAGTACAGTACTGGCAAAGGCAAAGCACCTGTTAAGAGCAAGCCAAAAGACCCATCAAAGAGACACCACTTATTCAGAAAGCGATTTGGTTGA
Protein:  
MGQHGHGDKGRPFKKFKPSNKFEDSSKRGFDDDDVYGGDDAHDEDDGKVKDFSKLELKPDHLNRPLWACGNGRIFLETFSPLYKQAYDFLIAIAEPVCRPESMHEYNLTPHSLYAAVSVGLETETIISVLNKLSKTKLPKEMISFIHDSTANYGKVKLVLKKNRYFIESPFPEVLKTLLKDEVISRARITSEGTNGDGFTISKAAGEIEGRHDELLNEAEVAAAAEEKETHAFEINPSQVENVKQRCLPNALNYPMLEEYDFRNDTVNPDLDMELKPQAQPRPYQEKSLSKMFGNGRARSGIIVLPCGAGKSLVGVSAASRIKKSCLCLATNAVSVDQWAFQFKLWSTIREENICRFTSDSKERFRGNAGVVVTTYNMVAFGGKRSEESEKIIEEIRNREWGLLLMDEVHVVPAHMFRKVISITKSHCKLGLTATLVREDERITDLNFLIGPKLYEANWLDLVKGGFIANVQCAEVWCPMTREFFAEYLKKENSKKRQALYVMNPNKFRACEFLINYHERARGDKIIVFADNLFALTEYAMKLRKPMIYGATSHVERTKILQAFKTSKDVNTVFLSKVGDNSIDIPEANVIIQISSHAGSRRQEAQRLGRILRAKGKLQDRMAGGKEEYNAFFYSLVSTDTQEMYYSTKRQQFLIDQGYSFKVITSLPPPDEGPRLSYHHLDDQLALLSKVLSAGDDAVGLEQLEEDTDEIALRHARRSQGSMSAMSGAKGMVYMEYSTGKGKAPVKSKPKDPSKRHHLFRKRFG